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Expression of Glycosyltransferase in Aspergillus 
Technical Field of the Invention 

The present invention is directed to increased secretion of desired polypeptides (particularly 
glycosyltransferases) from filamentous fungi (particularly Aspergillus). The invention discloses 
DNA sequences, vectors, fusion polypeptides, and processes for obtaining enhanced 
production and secretion levels of the desired polypeptide. More particularly, the invention 
discloses DNA sequences, vectors, fusion polypeptides and processes for the increased 
secretion of sialyltransferase, galactosyltransferase and fucosyltransferase in Aspergillus, 
preferably iA. n/ger var. awamori. 

Background of the Invention 

One of the earlier successes of recombinant DNA technology involved the intracellular 
expression of the A and B chains of insulin in bacteria as carboxyl fusions to p-galactosidase. 
Goeddel. D.V. et al. (1979) Proc. Natl. Acad. Sci. USA 76:106-110: Johnson, I.S. (1983) 
Science 219:632-637. Since then, numerous examples have been described for the expression 
effusion polypeptides comprising, in part, a heterologous polypeptide. Marston, F.A.O. (1986) 
Biochem. J. 240:1-12 summarizes the production of heterologous polypeptides in £. coli. As 
described therein, a number of heterologous polypeptides have been expressed intracellulariy 
as fusion polypeptide in E. colL In addition, heterologous polypeptides have reportedly been 
secreted into the periplasmic space of such microbes by fusing the heterologous polypeptide 
with a signal sequence. In some cases, the heterologous polypeptide was secreted from E. coli 
into the culture medium when expressed with a signal sequence of bacterial origin. In those 
cases where a heterologous protein has been expressed as a fusion with the entire native 
protein of the host bacteria, the rationale was primarily to increase stability or ease the 
purification of the fusion polypeptide. 

For example. Scholtissek, S. et al. (1988) Gene 62:55-64, report the expression in E. coli of a 
triprotein consisting of bacterial p-galactosidase, a collagenase recognition site and the single 
stranded DNA binding protein from E. coli. The p-galactosidase portion of this fusion 
polypeptide reportedly was used to purify the fusion polypeptide from a coide cell lysate by 
affinity chromatography on APTG-Sepharose. The single stranded DNA binding protein from E. 
co// was thereafter isolated from the fusion polypeptide by reacting the collagenase recognition 
site with collagenase. Similariy. Smith, D.B. et al. (1988) Gene 67:31-40 report the bacterial 
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expression of a vector encoding a fusion polypeptide consisting of glutathione S-transferase 
fused at its C-terminus with a recognition site for blood coagulation factor Xa which itself is fused 
to either of two heterologous polypeptides corresponding to different antigens of P. falciparum. 
In another example, Guan. C. et al. (1988) Gene 67:21-30 report the expression and purification 
of fusion polypeptides consisting of maltose binding protein fused either to p-galactosidase or 
PsW endonuclease, and a fusion protein consisting of the bacterial phoA signal, maltose binding 
protein and phoA protein. In the former cases, the fusion polypeptides were extracted from 
crude bacterial lysates by affinity chromatography on cross-linked amylose whereas, in the 
latter, the fusion protein was obtained from the periplasmic space after spheroplast formation 
and affinity chromatography on cross-linked amylose. 

The expression of fusion polypeptides in yeast has also been reported. For example, Cousens, 
LS. et aL (1987) Gene 61:265-275 describe a fusion polypeptide consisting of a human 
superoxide dismutase-human proinsulin fusion protein with a methionine residue at the junction 
of the two proteins. Superoxide dismutase is an intracellular protein and the fusion polypeptide 
was reportedly expressed as an insoluble inclusion body within the yeast expression host with 
incorrect disulfide bonds. After sulfitollysis proinsulin was reportedly purified, renatured and 
processed to yield insulin after cleavage of the methionine residue with cyanogen bromide. 

United States Patent No. 4,751,180 to Cousens et al. states that a polypeptide of interest may 
be obtained in high yield from an expression host, such as yeast, when the polypeptide of 
interest is expressed as a completely heterologous fusion polypeptide. One of the heterologous 
polypeptides is produced in high yield in the expression host typically in amounts greater than 
five percent of the total protein produced by the host. The only high yield heterologous 
polypeptide disclosed, however, is that of the intracellular protein human superoxide dismutase 
which is fused to either proinsulin or lgF-2. The specification also states that a secretory leader 
and processing signal may be included as part of the fused polypeptide. No example is 
provided which indicates that secretion would be obtained and, if obtained, would be at levels 
higher than that which have been obtained using a fusion construction which detected the high 
yield heterologous protein in a fusion consisting of a secretory leader sequence fused to only 
the polypeptide of interest, e.g., proinsulin or insulin-like growth factor (lgF-2). 

Heterologous gene expression has also been reported in filamentous fungi. For example, 
Christensen, T. et al. (1988) Bio/T echnoloqy 6:1419-1422 have reported an expression vector 
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utilizing the a-amylase promotor from A, oryzae to express the prepro form of aspartyl 
proteinase from the filamentous fungus Rhizomuchor miehei. When expressed in A. oryzae, 
aspartyl proteinase was obtained from the culture medium. Gwynne, D.L et al. (1987) 
Bio/Technology 5:713-719 report the expression and secretion of human interferon and 
bacterial endoglucanase from filamentous fungi by expressing these genes \Mth either a fungal 
glucoamylase signal or a synthetic consensus signal sequence. 

US Patent 5,364,770 and EP Patent 0 215 594, both incorporated herein by reference, teach 
the expression and secretion of various heterologous polypeptides, such as chymosin, in 
filamentous fungi, and particulariy Aspergillus. 

Upshall. A. et al, (1987) BioyTechnoloQv 5:1301-1304 report the expression and secretion of 
human tissue plasminogen activator by expressing the gene encoding the pre-fomi of t-PA in a 
filamentous fungus. Further, Tumbuli, I.F. et a!. (1989) Bio/Technoloqv 7:169-174 report an 
attempt to express and secrete bacterial enterotoxin subunit B from filamentous fungi. No 
secreted material, however, was detected. 

Bovine prochymosin has reportedly been expressed in Escherichia coli, the yeasts 
Saccharomyces cerevisiae and Yarrowia lipolytica, and in filamentous fungi by the inventor in 
Aspergillus species. In E. coli prochymosin, with the first four amino acid residues replaced by 
an amino-terminal fragment of the trpE gene, has reportedly been produced under the control of 
the trp promoter (Nishimori, K. et al. (1984) Gene 29:41-49). The fusion protein accumulated as 
inclusion bodies in the cytoplasm but after appropriate extraction conditions could be activated 
to yield mature chymosin. 

Moir etal. (1985) (in Developments in Industrial Microbiology, Vol. 26, Undericofler, LA. (ed.), 
Society for Industrial Microbiology. Ariington, VA, USA) described intracellular production of 
prochymosin in S. cerevisiae. The protein was synthesized with various segments of 
phosphoglycerate kinase, triosephosphate isomerase or galactokinase attached to the amino 
terminus, allowing increased production compared to direct expression from the same 
promoters. It was suggested that the increase in production was due to more efficient 
translation of the mRNA. Moir et al. also reported secretion of prochymosin from S. cerevisiae, 
in the form of a fusion with the first few residues of invertase or alpha factor. The extracellular 
prochymosin was activated at low pH to give mature chymosin despite the additional amino 
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acids on the prosequence. Similarly, activatable prochymosin was secreted from the yeast Y, 
lipolytica with either 14 or 90 residues of native alkaline extracellular protease attached to the 
amino terminus (Franke, A.E. et al. (1988) in Developments in Industrial Microbiology, Vol. 29, 
Pierce, G. (ed.), Society for Industrial Microbiology, Ariington, VA, USA). In this report, no more 
than about 20% of the amino terminus of the protease was used to generate the fusion 
-polypeptides and no apparent advantage accrued from expression as fusion polypeptides. 
Active calf chymosin has also been produced in the filamentous fungus Trichoderma reesei 
(Harkki, A. et al. (1989) Bio/T echnology 7:596-603. The cellobiohydrolase I gene (cbhl) 
promoter and terminator regions were employed and four different constructions were made 
employing different signal sequences fused to prochymosin cDNA. Either the chymosin signal 
sequence, cbhl signal sequence, a hybrid cbArZ/chymosin signal sequence or the cbhl signal 
sequence plus 20 amino adds of mature cbhl were fused to the amino terminus of 
prochymosin. Slightly better production was obtained from the latter construction although 
insufficient numbers of transformants were examined to confirm this. Secretion was inefficient 
with approximately 66% of the chymosin-derived material remaining within the cell of 
transformants regardless of the type of vector construction used. 

The glaA gene encodes glucoamylase which is highly expressed in many strains of Aspergillus 
n/ger and Aspergillus awarvori. The promoter and secretion signal sequence of the glaA gene 
have been used to express heterologous genes in Aspergilli including bovine chymosin in 
Aspergillus nidulans and A. awamori as previously described by Cullen, D. et al. (1987) 
Bio/TechnoloGv 5:713-719. In the latter experiments, a variety of constructs were made, 
incorporating prochymosin cDNA, either the glucoamylase or the chymosin secretion signal and, 
in one case, the first 1 1 codons of mature glucoamylase. Maximum yields of secreted chymosin 
obtained from A. awamori v^ere below 15 mg/L in 50 ml shake flask cultures and were obtained 
using the chymosin signal sequence encoded by pGRG3. These previous studies indicated 
that integrated plasmid copy number did not con^elate with chymosin yields, abundant 
polyadenylated chymosin mRNA was produced, and intracellular levels of chymosin were high 
in some transformants regardless of the source of secretion signal. It was inferred that 
transcription was not a limiting factor in chymosin production but that secretion may have been 
inefficient It was also evident that the addition of a small amino-terminal segment (1 1 amino 
acids) of glucoamylase to the propeptide of prochymosin did not prevent activation to mature 
chymosin. The amount of extracellular chymosin obtained with the first eleven codons of 
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glucoamylase, however, was substantially less than that obtained when the glucoamylase signal 
was used alone. 

USSN 08/318,491 and EP Application WO90/15860, incorporated herein by reference, disclose 
the enhanced secretion of desired polypeptides by and from filamentous fungi including fusion 
DNA sequences, expression vectors containing such DNA sequences, transformed filamentous 
fungi, fusion polypeptides and processes for expressing and secreting high levels of such 
desired polypeptides. Specifically described is the enhanced secretion of chymosin from 
filamentous fungi including fusion DNA sequences, vectors containing such DNA sequences, 
transformed filamentous fungi, fusion chymosin polypeptides and processes for expressing and 
secreting high levels of chymosin. 

Despite these publications regarding the expression and secretion of various polypeptides from 
various filamentous fungi, including the production of desired polypeptides in filamentous fungi 
as fusions, there is a need to provide enhanced production of certain desired polypeptides, 
particulariy glycosyltransferase enzymes, from Aspergillus, and particulariy A. niger. The 
tetrasaccharide sialyl Lewis x senses as a ligand for a family of cell adhesion molecules, known 
as selectins, which mediate the recmitment of neutrophils and other leucocytes to sites of: 
inflammation and tissue injury. An oligosaccharide mimicking sialyl Lewis x ((Cylexin® to be 
commercialized by Cytel. San Diego, CA) is being synthesized by combined chemical and 
enzymatic means for testing as a treatment for inflammatory diseases. Enzymatic synthesis 
involves using glycosyltransferases such as a-2,3-(N) sialyltransferase (ST). p-(1 .4) galactosyl 
transferase (GT) and a-(1.3) fucosyltransferase (FT) in conjunction with other enzymes that 
regenerate the donor sugar nucleotide substrate. Commercial scale production of the enzymes 
GT, FT and ST is difficult and. therefore, the present inventors have proposed to produce such 
transferases as fusions in an Aspergillus system. Using a-2.3-(N) sialyltransferase as a 
prototype for glycosyltransferase production, a-2,3-(N) sialyltransferase activity was tested in a 
variety of systems, including baculovims, mammalian cells and filamentous fungus, using 
multiple vector constructs. The filamentous fungus Aspergillus niger war, awamon\ 
advantageously, was found to produce consistent, high levels of a-2,3-(N) sialyltransferase that 
facilitates cost effective synthesis of the complex carbohydrates. 

Accordingly, an object of the present invention is to provide for the expression and enhanced 
secretion of certain glycosyltransferases in Aspergillus, including fusion DNA sequences. 
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expression vectors, transformed host cells, fusion transferases and processes for expressing 
and secreting high levels of such transferases. Glycosyltransferases of eukaryotes, which are 
involved in the glycosylation of proteins, are normally located vAthin the secretory apparatus of 
the cell. They typically possess a transmembrane anchor sequence which serves to localize 
them on a membrane. The transmembrane coding sequence is preferably deleted from the 
glycosyltransferase encoding DNA sequence for expression in Aspergillus. 

The references discussed above are provided solely for their disclosure prior to the filing date of 
the instant case. Nothing herein is to be construed as an admission that the inventors are not 
entitled to antedate such disclosure by virtue of prior invention or priority based on eariier filed 
applications. 

Summary of the Inventton 

In accordance with the above objects, the invention includes novel fusion DNA sequences 
comprising from the 5' terminus: 1) a DNA sequence encoding a signal peptide functional as a 
secretory sequence in Aspergillus] 2) a DNA sequence encoding a secreted polypeptide or 
portion thereof which is normally secreted from a filamentous fungus, and particulariy from 
Aspergillus] 3) optionally a DNA sequence encoding a cieavable linker polypeptide; and 4) a 
DNA sequence encoding a desired glycosyltransferase polypeptide. When the fusion DNA 
sequence is expressed in Aspergillus, increased secretion of the desired polypeptide Is obtained 
as compared to that which is obtained when the desired polypeptide is expressed from DNA 
sequences encoding a fusion polypeptide which does not contain the second polypeptide 
normally secreted from an Aspergillus host or as compared to the production of such 
glycosyltransferase in other expression systems such as baculovims. 

The invention also includes expression vectors containing the above fusion DNA sequence and 
Aspergillus strains transformed with such expression vectors. The invention also includes the 
fusion polypeptide encoded by such fusion DNA sequences. 

Further, the invention includes a process for producing a desired glycosyltransferase 
polypeptide comprising transforming a host Aspergillus strain with the above described 
expression vector and culturing the host strain to secrete the desired polypeptide into the culture 
medium. 
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Figure 1 shows a diagram of the 3-D organization of Aspergillus glucoamyiase. 

Figures 2 A-E show the construction of pGAST and pGASTK, including Seq ID Nos. 1 and 2 in 
Figure 2A; and Seq ID Nos. 3-6, respectively, in Figure 2B. 

Figures 3 A-E show the construction of pGADVST and pGADVSTK, including Seq ID Nos. 7-10, 
respectively, in Figure 3A. 

Figure 4 shows Sheftone N concentration effect. 

Figure 5 shows effect of media pH after inoculation. 

Figure 6 shows effect of sodium citrate. 

Figure 7 shows the map of pGAKHi+, an intemnediate vector for FT expression. 
Figure 8 shows the construction of pGAKHi+FT. 

Figure 9 shows the effect of agitation speed on FT activity in KK17 transformant. 
Figure 10 shows the effect of pH on FT activity in KK17 transfomnant 
Figure 1 1 shows the constnjction of pVL1393-FT. 
Detailed Description 

The inventors have discovered that desired glycosyltransferase polypeptides can be expressed 
and secreted at levels higher than that previously obtained from other expression systems such 
as baculovims or mammalian cells by expressing the transferase as a fusion of the transferase 
with a polypeptide which is nomnaily secreted from an Aspergillus host system. 

Many glycosyltransferases useful in the present invention have been cloned. For example, 
those listed in the following Table have been cloned and described in the literature. 
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Cloned Glvcosyltransferases 

I- Fucosvltransferases E.G. Number Reference 

a1.2 2.4.1.69 (1) (2) (3) 

a1,3/4(Fnil) - (4) 

a1,3(FnV) - (5) (6) (7) (8) 

a1,3(FTV) - (9) 

a1,3(FTVI) . (10) (11) 

a1.3(FTVII) . (12) 

II. Galactosvltransferases 

a1,3 2.4.1.151 (13) (14) (15) 

a1.3 - (16) 

P1.4 2.4.1.38 (17) -(21) 

III. N-Acetvlaalactosaminvltransferases 

a1,3 . (22) (23) 

pi .4 - (24) (25) 

pi, 4 (CT antigen) . (26) 

al.Thr polypeptide - (27) (28) 

IV. N-Acetvlqlucosaminvltransf erases 

pi.2GnTI 2.4.1.101 (29) -(32) 

p1,2GnTII . (33) 

p1,4GnTIII 2.4.1.144 (34) (35) 

pi,6GnTV . (36) (37) 

p1,6GnT(0-linked) - (33) 

pi,6GnT . (39) 

GPT . (40) -(43) 

Hyaluronan synthetase - (44) 

V. Sialvltransferases 

a2,6 (N) 2.4.99.1 (45) - (50) 

a2,6 (O) 2.4.99.3 (51) 

a2.6 (O) - (52) 

a2.3 (N) 2.4.99.6 (53) (54) 

a2,3 (O) 2.4.99.4 (55) (56) 

a2,3 (O) - (57) 
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a2.3 (O/N) 



(58) (59) 



a2,8 



(60) (61) 
(62) 



STX 



VI. Glucosvltransferases 
AlgS 

Alg5 

VII. Mannosvltransferases 



(63) 
(64) 



a1,2 



(65) 
(66) 
(67) 
(68) 
(69) 
(70) 



a1.3(Alg2) 
p1.4(Alg1) 
Dpm1 



Ochi 



Pmt 



References 

(1) Ernst et al. (1989) J. Biol. Chem. 264:3436-3447 

(2) Larsen et al. (1 990) Proc. Natl. Acad. Sci. 87:6684-6678 

(3) Rajan et al. (1989) J. Biol. Chem. 264: 11 158-1 1 167 

(4) Kukowska-Latallo et al. (1990) Genes Dev. 4:1288-1303 

(5) Lowe et al. (1 990) CeH 63:475-484 

(6) Goelz et al. (1 990) CeH 63: 1 349-1 376 

(7) Lowe et al. (1 991 ) J. Biol. Chem. 266: 1 7467-1 7477 

(8) Kumar et al. (1991) J. Biol. Chem. 266-21777-21783 

(9) Weston et al. (1 992) J. Biol. Chem. 267:41 52-41 60 

(1 0) Weston et al. (1 992) J. Biol. Chem. 267:24575-24584 

(1 1 ) Koszdin and Bowen (1 992) BBRC 1 87: 1 52-1 57 

(12) Natsuka et al. (1994) J. Biol. Chem. 269: 

(1 3) Larsen et al. (1 989) Proc. Natl. Acad. Sci. 86:8227-8231 (murine) 

(14) Joziasse et al. (1 989) J. Biol. Chem. 264:14290-14297 (bovine) 

(1 5) Dabkowski et al. (1 993) Transplant. Proc. 25:2921 

(16) Yamamoto et al. (1990) Nature 345:229-233 

(17) Shaper et al. (1988) J. Biol. Chem. 263:10420-10428 

(1 8) Narimatsu et al. (1 986) Proc. Natl. Acad. Sci. 83:4720-4724 

(1 9) Masri et al. (1 988) BBRC 1 57:657-663 (human) 



BNSOOCID: <W0 963671 8AIJ_> 



wo 96/36718 PCT/US96/07171 

- 10 - 

(20) D'Agostaro et al. (1989) Eur. J. Biochem. 183:21 1-217 (bovine) 

(21) Russo et al. (1990) J. Biol. Chem. 265:3324-3331 

(22) Yamamoto et al. (1 990) Nature 345:229-233 

(23) Yamamoto et al. (1990) J. Biol. Chem. 264:1 146-1 151 

(24) Nagata et al. (1992) J. Biol. Chem. 267:12082-12089 

(25) Furakawa et al. (1993) Trends GIvcoscience GIvcotech. 5:171-179 

(26) Smith and Lowe (1994) J. Biol. Chem. 269:15162-15171 

(27) Hagen et al. (1 993) J. Biol. Chem. 268: 1 8960-1 8965 

(28) Homa et al. (1993) J. Biol. Chem. 268:12609-12616 

(29) Pownall et al. (1 992) Genomics 1 2:699-704 (mouse) 

(30) Kumar et al. (1990) PNAS 87:9948-9952 (human) 

(31) Hull et al. (1991) BBRC 176:608-615 (human) 

(32) Sarkar et al. (1 991 ) Proc. Natl. Acad. Sci. 88:234-238 (rabbit) 

(33) Schaeter, unpublished 

(34) Nishikawa et al. (1992) J. Biol. Chem. 267:18199-18204 

(35) Ihara et al. (1 993) J. Biochem. 1 1 3:692-698 

(36) Bierhuizen et al. (1 993) Genes Dev. 7:468-478 

(37) Shoreiban et al. (1993) J. Biol. Chem. 268:15381-15385 

(38) Bierhuizen and Fukuda (1 992) PNAS 89:9326-9330 

(39) Fukuda, unpublished 

(40) Datta and Lehrman (1993) J. Biol. Chem. 268: 12663-12668 

(41) Zhu and Lehrman 91990) J. Biol. Chem. 265:14250-14255 

(42) Scocca and Krag (1990) J. Biol. Chem. 165:20621-20626 

(43) Rajput et al. (1 992) Biochem. J. 285:985-992 

(44) DeAngelis et al. (1993) J. Biol. Chem. 268:19181-19184 

(45) Weinstein et al. (1987) J. Biol. Chem. 262:17735-17743 

(46) Colley et al. (1 989) J. Biol. Chem. 264: 1 761 9-1 7622 

(47) Grundmann et al. (1990) Nucl. Acids Res. 18:667 

(48) Bast et al. (1 992) J. Cell Biol. 1 1 6:423-435 

(49) Hamamoto et al. (1993) Bioorq. Medic. Chem. 

(50) Kurosawa et al. (1994) Eur. J. Biochem. 219:375-381 

(51) Kurosawa et al. (1994) J. Biol. Chem. 269: 1402-1409 

(52) Kurosawa et al. (1994) submitted 

(53) Wen et al. (1992) J. Biol. Chem. 267:2101 1-21019 



BNSDOCID: <W0 963671 BA1_L> 



wo 96/36718 



- 11 - 



PCT/US96/07171 



(54) Kitagawa and Paulson (1993) Biochem. Biophs. Res. Comm. 194:375-382 

(55) Gillespie et al. (1992) J. Biol. Chem. 267:21004-21010 

(56) Lee et al. (1 993) Eur. J. Biochem. 21 6:377-385 

(57) Lee et al., J. Biol. Chem. 269:10028-10033 

(58) Kitagawa and Paulson (1994) J. Biol. Chem. 269:1394-1401 

(59) Sasaki et al. (1993) J. BioL Chem. 268:22782-22787 

(60) Weisgerber et al. (1991) GIvcobioloQv 1:357-365 

(61) J. Biol. Chem. 268:11504-11507 (rat) 

(62) Livingston and Paulson (1993) J. Biol. Chem. 268:11504-11507 

(63) Stagljov et al. (1994) PNAS 91:5977 

(64) Heesen et al. (1 994) Eur. J. Biochem. 224:71-79 

(65) Haeusler and Robbins (1992) GIvcobioloav 2:77-84 

(66) Jackson et al. (1 992) GIvcobioloav 3:357-364 

(67) Albright et al. (1 989) J. Biol. Chem. 263:7042 

(68) Oriean et al. (1988) J. Biol. Chem. 263:17499-17507 

(69) Nakayama et al, (1 992) EMBO 1 1 :251 1-251 9 

(70) Strahl-Bahlsinger et al. 

By using the fusion DNA constructions in Aspergillus as described herein, a dramatic increase in 
extracellular glycosyltransferase has been obtained versus expression in other systems. In 
some cases expression levels are approximately 10 fold in excess of that obtained in other 
expression systems. 

The increased secretion levels of glycosyltransferase are the result of expressing the 
transferase as a fusion polypeptide with a polypeptide normally secreted by a filamentous 
fungus. In the preferred embodiments, glucoamylase from A, n/ger var. awamon encoded by 
the glaA gene, including the secretory signal sequence of glucoamylase, is fused to the amino 
temninus of the transferase from which the transmembrane anchor region has been deleted. 
The presence of the glucoamylase signal sequence and all or part of mature glucoamylase 
peptide sequences facilitate the enhanced secretion of the fusion polypeptide into the culture 
medium. 

As used herein, a "fusion DNA sequence" comprises from 5* to 3* DNA encoding: a signal 
peptide; a secreted polypeptide or a portion thereof; optionally a cleavable linker polypeptide; 
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and a desired glycosyltransferase. Specifically, the fusion DNA comprises DNA encoding a 
signal peptide functional as a secretory sequence in Aspergillus. Such signal sequences 
include those from glucoamylase. a-amylase and aspartyl proteases from Aspergillus awamori, 
Aspergillus niger, Aspergillus oryzae, signal sequences from cellobiohydrolase I, 
cellobiohydrolase II, endoglucanase I, endoglucanase II from Trichoderrna, signal sequences 
from glucoamylase from Neurospora and Humicola, as well as signal sequences from 
eukaryotes, including the signal sequence from bovine chymosin, human tissue plasminogen 
activator, human interferon and synthetic consensus eukaryotic signal sequences such as that 
described by G\A/ynne (1987) supra. Particularly preferred signal sequences are those derived 
from polypeptides secreted by the expression host used to express and secrete the fusion 
polypeptide. For example, the signal sequence from glucoamylase from Aspergillus awamori is 
preferred when expressing and secreting a fusion polypeptide from Aspergillus awamori. 

In addition, the fusion DNA sequence comprises DNA encoding "secreted polypeptides" 
normally expressed from filamentous fungi. Such secreted polypeptides include but are not 
limited to glucoamylase, a-amylase and aspartyl proteases from Aspergillus awamori, 
Aspergillus nigen and Aspergillus oryzae, cellobiohydrolase I, cellobiohydrolase II. 
endoglucanase I and endoglucanase II from Trichoderma and glucoamylase from Neurospora 
species and Humicola species. As with the signal sequences, preferred secreted polypeptides 
are those v^ich are naturally secreted by the Aspergillus expression host. Thus, for example, 
when using Aspergillus awamori, preferred secreted polypeptides are glucoamylase and a- 
amylase from Aspergillus awamori, most preferably glucoamylase. 

Fusion DNA sequences may further comprise DNA sequences encoding a cleavable linker 
polypeptide. Such sequences include those which encode the prosequence of bovine 
chymosin. the prosequence of subtilisin, prosequences of retroviral proteases, including human 
immunodeficiency virus protease and DNA sequences encoding amino acid sequences 
recognized and cleaved by trypsin, factor Xa collagenase, clostripin, subtilisin, chymosin. fungal 
KEX2 protease and the like. See, e.g., Marston. F.A.O. (1986) Biol. Chem. J. 240:1-12. Such 
linker sequences may also encode the amino acid methionine which may be selectively cleaved 
by cyanogen bromide. It should be understood that the linker sequence need only encode that 
amino acid sequence which is necessary to be recognized by a particular enzyme or chemical 
agent to bring about cleavage of the fusion polypeptide. Thus, the entire prosequence of, for 
example, chymosin or subtilisin need not be used. Rather, only that portion of the prosequence 
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which is necessary for recognition and cleavage by the appropriate enzyme is required. For 
example, the linker may comprise a recognition sequence (Lys-Arg) for the key protease which 
sequence may be inserted at the fusion junction as in pGASTK. 

Furthemiore. the fusion DNA sequence comprises a "desired glycosyltransferase polypeptide." 
Such desired polypeptides include any glycosyltransferase as provided in the Table above. 
Preferred glycosyltransferases useful in the present invention include a-(2,3) sialyltransferase; 
p-(1.4) galactosyltransferase and a-(1,3) fucosyltransferase. 

The above-defined DNA sequences encoding the corresponding amino acid sequences are 
combined to form a "fusion DNA sequence." Such fusion DNA sequences are assembled in 
proper reading frame from the 5* terminus to 3' terminus in the order presented above. As so 
assembled, the DNA sequence will encode a "fusion polypeptide" encoding from its amino 
temiinus a signal peptide functional as a secretory sequence in Aspergillus, a secreted 
polypeptide or portion thereof nomnally secreted from Aspergillus, a cleavable linker polypeptide 
and a glycosyltransferase. 

As indicated, the DNA encoding a signal peptide is functional as a secretory signal in an t 
Aspergillus regardless of whether the signal peptide is from an Aspergillus or from some other 
source such as the chymosin signal from bovine species. The signal sequences may be ... 
derived from a secreted polypeptide from a particular species of Aspergillus. 
As indicated, all or part of the mature sequence of the secreted polypeptide is used in the . 
constixjction of the fusion DNA sequences. In a preferred embodiment of the present invention, 
the secreted polypeptide is glucoamylase from A. niger or A, n/gervar. awamori. 

Aspergillus niger and Aspergillus n/gervar. awamori {A, awamori) glucoamylases have 
identical amino acid sequences. The glucoamylase is initially synthesized as 
preproglucoamylase. The pre and pro regions are removed during the secretion process 
so that mature glucoamylase is released to the external medium. Two forms of mature 
glucoamylase are recognized in culture supematants, GAI which is the full-length form 
(amino acid residues 1-616) and GAI I which is a natural proteolytic fragment comprising 
amino acid residues 1-512. GAI is known to fold as two separate domains joined by an 
extended linker region. The two domains are the 471 residue catalytic domain (amino 
acids 1-471) and the 108 residue starch binding domain (amino acids 509-616), the linker 
region being 36 residues in length (amino acids 472-508). GAII lacks the starch binding 



SNSDOCtD: <WO 963671 8A1 J _> 



wo 96/36718 PCT/US96/07171 

- 14 - 

domain. These details of glucoamyiase structure are reviewed by Libby et al. (Protein 
Engineering 7:1109-1114. 1994) and are shown in diagrammatic form in Fig. 1. 

Desired polypeptides have previously been produced in filamentous fungi as fusions with a 
naturally secreted fungal polypeptide. In each case where a comparison has been made 
between production of desired polypeptide as a fusion with a naturally secreted fungal 
polypeptide versus production of the desired polypeptide alone (i.e., not fused to another 
polypeptide), a significant increase in yields of secreted desired polypeptide resulted from 
production as a fusion protein. Bovine chymosin and porcine pancreatic prophospholipase 
A2 have both been produced in A. nigerorA. n/ger var awamon as fusions to full-length 
GAI (USSN 08/318,494; Ward et al., Bio/Technoloav 8:435-440.1990; Roberts et al., Gene 
122:155-161, 1992). Human interleukin 6 (hlL6) has been produced in A. nidulans as a 
fusion to full-length A. nigerGM (Contreras et al., Biotechnology 9:378-381, 1991). 
Fusions of desired polypeptides with truncated forms of glucoamyiase have also been 
produced. Unpublished work by the inventors has shown that fusion of bovine 
prochymosin to glucoamyiase residues 1-501 leads to similar yields of chymosin as fusion 
of prochymosin to full-length glucoamyiase. Hen egg white lysozyme (Jeenes et al., FEMS 
Microbiol. Lett. 107:267-272, 1993) and human lactofemn (Ward et al., Bio/Technolog y 
13:498-503, 1995) have been produced in A. niger as fusions to residues 1-498 of 
glucoamyiase and hlL6 has been produced in A. niger as a fusion to glucoamyiase 
residues 1-514 (Broekhuijsen et a!., J. Biotechnol. 31:135-145, 1993). In some of the 
above experiments (Contreras et a!., 1991; Broekhuijsen et al.. 1993; Ward et al., 1995) a 
KEX2 protease recognition site (Lys, Arg) has been inserted between glucoamyiase and 
the desired polypeptide to allow in vivo release of the desired polypeptide from the fusion 
protein as a result of the action of a native Aspergillus KEX2-like protease. 

Additionally, bovine chymosin has been produced in A. niger as a fusion with full-length 
native alpha-amylase (Komian et al., Curr. Genet. 17:203-212, 1990) and in A, oryzae as a 
fusion with truncated forms of A. oryzae glucoamyiase (either residues 1-603 or 1-511; 
Tsuchiya et al,. Biosci. Biotech. Biochem. 58:895-899, 1994). 

Thchoderma reesei produces several cellulase enzymes, including cellobiohydrotase I 
(CBHI), which are folded into two separate domains (catalytic and binding domains) 
separated by an extended linker region. Foreign polypeptides have been secreted in T. 
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reesei as fusions with the catalytic domain plus linker region of CBHI (Nyyssonen et al., 
Bio/Technology 11:591-595. 1993). 

Without intending to be limited to a particular theory^ applicants suggest that the amount of 
glucoamylase or other naturally secreted fungal polypeptide which is needed to enhance 
the production of a desired polypeptide can be surmised based on the known 3-D structure 
of glucoamylase. In the case of bovine chymosin production by A. n/ger van awamori it has 
been shown that fusion of prochymosin following the 11th amino acid of mature 
glucoamylase provided no benefit compared to production of preprochymosin (US Patent 
5.364,770). The inventors have shown (USSN 08/318,494) that fusion of prochymosin 
onto the C-terminus of preproglucoamylase up to the 297th amino acid of mature 
glucoamylase plus a repeat of amino acids 1-11 of mature glucoamylase yielded no 
secreted chymosin in A. n/ger var. awamori. In the latter case it is unlikely that the portion 
(approximately 63%) of the glucoamylase catalytic domain present in the fusion protein was 
able to fold correctly so that an aberrant, mis-folded and/or unstable fusion protein may 
have been produced which could not be secreted by the cell. The inability of the partial 
catalytic domain to fold correctly may have interfered with the folding of the attached ' 
chymosin. Thus, it is likely that sufficient residues of a domain of the naturally secreted 
polypeptide must be present to allow it to fold in its normal configuration independently of 
the desired polypeptide to which it is attached. Evans et al. (Gene 91:131-134, 1990) 
showed that a form of A. n/ger glucoamylase truncated at its C-terminus back to amino acid 
460 in the catalytic domain was not efficiently secreted by yeast. However, the relevance 
of this to secretion in Aspergillus is not known. 

A linker region between the desired polypeptide and the domain of the naturally secreted 
fungal polypeptide may be beneficial in allowing the two polypeptides to fold independently. 
However, it may be possible to dispense with the linker region altogether. In the examples 
above where a foreign polypeptide is fused to the C-terminus of full-length glucoamylase or 
alpha-amylase, there is no linker separating the desired polypeptide from the C-terminus of 
the fungal polypeptide. Linkers other than the glucoamylase or CBHI linkers mentioned 
above could be used to connect a domain of a naturally secreted fungal polypeptide and 
the desired polypeptide. The salient features of a linker region are that it forms an 
extended, semi-rigid spacer between independently folded domains. Such linker regions 
are found in several proteins, especially hydrolases such as bacterial and fungal cellulases 
and hemicellulases (reviewed in Libby et al. 1994), 
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Based on the above, either the full length secreted polypeptide or a functional portion thereof 
may be employed in the present invention. As used herein a "portion" of a secreted polypeptide 
is defined functionally as that portion of a secreted polypeptide which when combined with the 
other components of the fusion polypeptide defined herein results in increased secretion of the 
desired polypeptide as compared to the level of desired polypeptide secreted when an 
expression vector is used which does not utilize the secreted polypeptide. Generally, such 
portions of the secreted polypeptide comprise greater than 50% of the secreted polypeptide. 

Specific expression hosts include A. nidulans (Yelton, M. et al. (1984) Proc. Natl. Acad. Sci. 
USA 81:1470-1474; Mullaney, E.J. et al. (1985) Mol. Gen. Genet. 199:37-45; John. M.A. and 
Peberdy, J.F. (1984) Enzyme Microb. Techno!. 6:386-389; Tilbum et al, (1982) Gene 26:205- 
221; Ballance, D.J. et al. (1983) Biochem. Biophvs. Res. Comm. 112:284-289; Johnston, I.L. et 
al. (1985) EMBO 4:1307-1311); A. niger (KeWy, J.M. and Hynes M. (1985) EMBO 4:475-479); A. 
awamori, e.g., NRRL 31 12. ATCC 22342, ATCC 44733, ATCC 14331 and strain UVK 143f; A. 
oryzae, e.g.. ATCC 1 1490; A/, crassa (Case, M.E. et al. (1979) Proc. Natl. Acad. Sci. USA 
76:5259-5263; Lambowitz US Patent No. 4,486,553; Kinsey, J.A. and Rambosek, J.A. (1984) 
Molecular and Cellular Bioloov 4:1 17-122; Bull, J.H. and Wooton, J.C. (1984) Nature 310:701- 
704); Trichoderma reesei, e.g. NRRL 15709, ATCC 13631, 56764. 56765, 56466, 56767 and 
Trichodenva viride, e.g., ATCC 32098 and 32086. A preferred expression host is A. niger war. 
awamori in which the gene encoding the major secreted aspartyl protease has been deleted. 
The production of this preferred expression host is described in USSN 214,237 filed July 1, 
1988. expressly incorporated herein by reference. 

As used herein, a "promoter sequence" is a DNA sequence which is recognized by the 
particular Aspergillus species for expression purposes. It is operably linked to a DNA sequence 
encoding the above defined fusion polypeptide. Such linkage comprises positioning of the 
promoter with respect to the translation initiation codon of the DNA sequence encoding the 
fusion DNA sequence. The promoter sequence contains transcription and translation control 
sequences which mediate the expression of the fusion DNA sequence. Examples include the 
promoter from the A. awamori or A, n/ger glucoamylase genes (Nunberg, J.H. et al. (1984) Mol. 
Cell. Biol. 4:2306-2315; Boel, E. et al. (1984) EMBO 3:1581-1585), the Mucor miehei cattox^l 
protease gene (EPO Patent 0 215 594), the Trichoderma reese/ cellobiohydrolase I gene 
(Shoemaker, S.P. et al. (1984) European Patent Application No. EP 0 137 280 Al), the A. 
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nidu/ans trpC gene (Yelton, M. et al. (1984) Proc. Natl. Acad. Sci. USA 81:1470-1474; Mullaney, 
EJ. et al. (1985) MoL Gen. Genet. 199:37-45), the A. nidulans alcA gene (Lockington, R.A. et 
al. (1986) Gene 33:137-149), the A. nidulans tpiA gene (McKnight, G.L et al. (1986) CeH 
46:143-147), the A. nidulans amdS gene (Hynes, MJ. et al. (1983) Mol. Cell Biol. 3:1430^1439), 
and higher eukaryotic promoters such as the SV40 eariy promoter (Barclay, S.L and Meller, E. 
(1983) Molecular and Cellular Biology 3:2117-2130). 

Likewise a "terminator sequence" is a DNA sequence which is recognized by the expression 
host to temninate transcription. It is operably linked to the 3' end of the fusion DNA encoding the 
fusion polypeptide to be expressed. Examples include the temiinator from the A, nidulans trpC 
gene (Yelton, M. et al. (1984) Proc. Natl. Acad. Sci. USA 81:1470-1474; Mullaney, E.J. et al. 
(1985) Mol. Gen. Genet. 199:37-45), the A. awamori or A. n/ger glucoamylase genes (Nunberg, 
J.H. et al. (1984) Mol. Cell. Biol. 4:2306-253; Boel, E. et al. (1984) EMBO 3:1581-1585), and the 
Mucorn)iehei carboyyl protease gene (EPO Patent No. 0 215 594), although any fungal 
terminator is likely to be functional in the present invention. 

A "polyadenylation sequence" is a DNA sequence which when transcribed is recognized by the 
expression host to add polyadenosine residues to transcribed mRNA. It is operably linked to the 
3* end of the fusion DNA encoding the fusion polypeptide to be expressed. Examples include 
polyadenylation sequences from the A. nidulans trpC gene (Yelton, M. et al. (1984) Proc. Natl. 
Acad. Sci. USA 81 : 1470-1474; Mullaney, E.J. et al. (1985) Mol. Gen. Genet. 199:37-45), the A, 
awamori or A. r?/ger glucoamylase genes (Nunberg, J.H. et al. (1984) Mol. Cell. Biol. 4:2306- 
2315; Boel, E. et al. (1984) EMBO 3:1581-1585), and the Mucormiehei carboxyl protease gene 
(EP Patent 0 215,594). Any fungal polyadenylation sequence, however, is likely to be 
functional in the present invention. 

Examples 

Example 1 
Construction of pGAST and pGASTK 

These plasmids were made to express the truncated form of alpha 2,3 sialyltransferase as 
a fusion of the entire glucoamylase coding region which included the catalytic core, the 
linker region and the starch binding domain in Aspergillus n/ger var. awamori. The 
truncated form of sialyltransferase begins with leucine, the thirtieth amino acid from the 
initial methionine of the mature protein sequence. A BamHI/H/ndlll fragment of rat liver 
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alpha 2,3 sialyltransferase which included alt of the stoictural gene and some 5' and 3' 
flanking ends was subcloned into pUC219. The original clone. pGIR23ST (Ichikawa et al. 
(1992) J. Am. Chem. Soc. 114:9283-9298), contained the full length a-2,3 sialyltransferase 
gene. A BamHl fragment of the gene was provided by Cytel Corp. (San Diego, CA, USA). 
Single stranded template was made of the resulting plasmid, pUC219:ST. Oligonucleotides 
were used to introduce a unique EcoRI restriction site at the 5* region of sialyltransferase 
and a unique Xba\ site at the 3' end of the gene. The EcoRI site was located at amino acid 
number 38 and 39 of the mature protein sequence and the oligonucleotide used was 5* 
ATGGGAAGACTCGAATTCACTGATT 3' (Seq ID No. 1). An Xba\ site was introduced 17 
bases after the stop codon of ST using the oligonucleotide 5* 

GCCGGACACTGGTCTAGAGAGGCTCAC 3' (Seq ID No, 2). Site directed mutagenesis as 
described by Maniatis et al. (in Molecular Cloning, A Laboratory Manual, 1980, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New Yori<) was used to introduce these sites into 
the single stranded template of pUC219:23ST, The correct clones of pUC219:ST Rl/Xba 
were identified by restriction analysis and verified by DNA sequencing. This plasmid was 
subsequently digested with EcoRI and Xba\ and the 1.0 kb sialyltransferase fragment was 
isolated for ligations. 

Linker oligonucleotides were designed to: 1) connect the end of the glucoamylase coding 
region from a Sail site, 31 base pairs from the 3' end of the coding region, to the truncated 
version of sialyltransferase; 2) include or not include a KEX2 protease cleavage site 
between the structural genes; and 3) reconstruct the final part of the glaA coding region 
from the Sail site to the end of glaA coding region (minus the stop codon), as well as 
reconstruct the sialyltransferase gene from leucine codon #30 to the introduced EcoRI at 
codons 38 and 39. The KEX2 cleavage site which is recognized by the KEX2 protease is 
made up of two amino acids, lysine and arginine being the favored pair for cleavage. The 
pair of complimentary oligonucleotides used with the KEX2 protease cleavage site were 5' 
TCG ACC GCG ACG GTG ACC GAC ACC TGG CGG AAG CGC CTA CAC TTA CTC CAA 
TGG GAA GAC TCG 3' (Seq ID No. 3) and 5* A ATT CGA GTC TTC CCA TTG GAG TAA 
GTG TAG GCG CTT CCG CCA GGT GTC GGT CAC CGT CGC GG 3' (Seq ID No. 4). 
The pair of complimentary oligonucleotides used to combine the two genes without the 
KEX2 protease cleavage site were as follows: 5* TCG ACC GCG ACG GTG ACC GAC 
ACC TGG CGG CTA CAC TTA CTC CAA TGG GAA GAC TCG 3* (Seq ID No. 5) and 5'A 
ATT CGA GTC TTC CCA TTG GAG TAA GTG TAG CCG CCA GGT GTC GGT CAC CGT 
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CGC GG 3' (Seq ID No. 6). Both sets of oligonucleotide linkers were annealed and 
phosphorylated before ligation into the vector; pSL1180 (Pharmacia LKB Biotech, 
Piscataway, NJ, USA) digested with Sail and EcoRI. Several clones of the resulting 
plasmids, pSL1 180:Sal-EcoR1 linker and pSL1180:Sal-EcoR1 kex2 linker, were sequenced 
and correct clones were identified. The correct clones of each were then digested with the 
restriction enzymes Sa/I and EcoRI and the linkers of both types were isolated for ligations. 

The vector. p:GAMpR was used to isolate a portion of the glucoamylase promoter, ali of the 
glucoamylase terminator pBR322, and the Aspergillus n/gervar. awamoh pyrG selectable 
marker by digesting it with the restriction enzymes Mlu\ and Xba\, yielding a 9.6 kb and a 
3.7 kb fragment. The 9.6 kb fragment was isolated for ligation. p:GAMpR was previously 
described in US Patent 5,364,770. Briefly, this plasmid contains the Aspergillus n/ger var. 
awamori U\/K143f glucoamylase promoter, glucoamylase coding region fused to bovine 
chymosin coding region, glucoamylase terminator, and the pyrG selectable marker and 
bovine chymosin in a pBR322 backbone. The 9,6 kb Mlu\/Xba\ fragment contained 1441 
bases of the 5' portion of the glucoamylase promoter, the glucoamylase terminator. pyrG, 
and pBR322. 

The vector pUC:AaGAM contained the Aspergillus n/ger var. awamori UVK143f 
glucoamylase promoter, glucoamylase structural gene, and glucoamylase terminator in a 
pUC vector This was digested with restriction enzymes Mlu\ and Sa/1 to yield a 6.3 and a 
2.7 kb fragment The 2.7 kb fragment, which contained 526 bases of the 3' end of the 
glucoamylase promoter and the whole coding region except for the final base pair, was- 
isolated for ligation. 

A four way ligation took place with all of the above isolated fragments: Mlu/Xba of 
p:GAMpR. the MluUSal\ of pUC:AaGAM. the EcoR\/Xba\ of pUC219:23ST Rl/Xbal. and 
either the Sa/-EcoRI linker or the Sa/-EcoRI kex2 linker. The ligation produced several 
correct clones which were confirmed by restriction analysis. These plasmids were named 
pGAST (no KEX2 protease cleavage site) and pGASTK (with the KEX2 protease cleavage 
site). 

The resulting plasmids were purified on a cesium chloride (CsCI) gradient in preparation for 
transformation into Aspergillus nigervar awamori. 
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Example 2 

Construction of pGADVST and pGADVSTK 
These plasmids were made to express the truncated form of alpha 2,3 sialyltransferase as 
a glucoamylase fusion in Aspergillus n/ger var. awamon similar to the constructs described 
in Example 1. However, in these constructs, the last 120 codons of the glucoamylase 
starch binding domain were omitted. Two approaches were taken to express 
sialyltransferase as a glucoamylase fusion. The first approach involved fusing the 
glucoamylase coding region from the Nhe\ site which included all of the catalytic core and 
most of the glucoamylase linker region through amino acid #496 (alanine) to the 
sialyltransferase coding region from leucine #30. The second approach was nearly 
identical to the first approach with the addition of a KEX2 cleavage site in between the two 
coding regions. The following oligonucleotide pairs were used to fuse the two coding 
regions: 5' C TAGCAAAGCTACACTTACTCCAATGGGAAGACTCG 3* (Seq ID No. 7) and 
5' AATTCGAGTCTTACCATTGGAGTAAGTGTAGCTTG 3' (Seq ID No. 8) for the constmct 
without the KEX2 protease cleavage site and 5' 

CTAGCAAGCGCCTACACTTACTCCAATGGGAAGACTCG 3* (Seq ID No. 9) and 5* 
AATTCGAGTCTTACCATTGGAGTAAGTGTAGGCGCTTG 3* (Seq ID No. 10) for the 
construct which included a KEX2 protease cleavage site in between the coding regions. 
The primer pairs were annealed by adding equal volumes of the primers into a tube and 
placing the tubes in a 100*C heat block for 2 minutes. The tubes were then slowly cooled 
to room temperature. The annealed primers were then phosphorylated as described by 
Maniatis et al. (in Molecular Cloning, A Laboratory Manual, 1980, Cold Spring Harbor 
Laboratory, Cold Spring HariDor, New York). The vector pSL1 180 (Pharmacia LKB Biotech) 
was digested with Nhe\ and EcoRI at 37X for 1 hour and dephosphorylated with calf 
alkaline phosphotase (Boehringer Mannheim) for 1 hour, again at 37°C. Both sets of the 
annealed and phosphorylated linkers were ligated to the pSLIISO Nhe\/EcoR\ digested 
vector in two separate ligations and transformed into E. coli, DNA was isolated from E. colt 
colonies obtained from these ligations and restriction endonuclease digestion identified 
potential correct clones of the Nhe\/EcoR\ linkers were obtained and after DNA sequence 
analysis, correct linkers were identified. The plasmids containing the correct linkers were 
digested with Nhe\ and EcoRI and subjected to electrophoresis on a 4% GTG NuSieve 
agarose gel (FMC Corporation, Rockland, Maine). After electroelution of the DNA bands 
containing the correct linkers from agarose, ethanol precipitation, and resuspension into TE 
buffer (lOmMTris H-CI. pH 7.6, 1 mM EDTA), the linkers were now ready for use. 
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An Mlu\/Nhe\ digest was performed on pGAMpRpyr (described above) and the 2.3 kb 
fragment that included 526 bases of the glaA promoter and the glucoamylase coding region 
up to codon 496 was isolated on a 1% agarose gel and eiuted. An EcoR\/Nco\ digest of 
pUC219:23ST (described above) produced an 800 base pair sialyltransferase fragment that 
was missing the last 202 bases of the coding region. This fragment was isolated. 

pSL1180 (Pharmacia LKB Biotech) was digested with Mlu\/Nco\, dephosphorylated. gel ' 
isolated and electroeluted. With each of the above isolated fragments and linkers, two 
four-way ligations were done. One set involved the linker without the KEX2 protease 
cleavage site and the other set involved the linker with this cleavage site. The resulting two 
plasmids, pSLGADVST Part 1 and pSL:GADVSTK Part 1, were confirmed to be correct by 
restriction analysis. Both plasmids were then digested with Nhe\ and Avr\\ and an 80 base 
pair fragment was isolated from each plasmid which included the linker and 32 bases 3* 
from the EcoRI site of sialyltransferase. An Nhe\/Avf\\ digest was done on 
pSL:GASTpyrGV (refer to Figure 3C) to yield the large 1 1.3 kb vector fragment. 
pSLiGASTpyrGV was the result of subcloning a Spel and C/al fragment from pGASt-into 
pSL1180. The Spel/C/al fragment contained approximately 800 bases of the 3' end of the 
promoter, the entire glucoamylase coding region, the truncated sialyltransferase gene and 
the glaA terminator. The resulting plasmid pSLGAST was then digested with EcoRV. a- 
unique site found 250 bases from the 5' end of the terminator. A Sma\ISca\ fragment 
containing the A, nigerpyrG selectable marker was ligated into the pSL:GAST EcoRV 
digested vector resulting in pSL:GASTpyrGV. 

Two separate ligations were performed with both 80 base pair linkers from pSL:GADVST 
Part 1 and pSL:GADVSTK Part 1 into the Nhe\/Avr\\ digested pSL:GASTpyrGV. The 
resulting plasmids. after confirmation by restriction analysis, were named pSL.GADVST 
Part 2 and pSLiGADVSTK Part 2. and each contained the glucoamylase promoter from the 
Mlu\ site, the glucoamylase coding region without the starch binding domain, the linker with 
(pSLGADVSTK) and without the KEX2 (pSL:GADVST) cleavage site, the truncated 
sialyltransferase gene and the glaA terminator. Both pSLrGADVST Part 2 and 
pSLGADVSTK Part 2 were digested with Mlul and a 3.3 kb fragment was isolated 
containing glucoamylase promoter and coding region without the starch binding domain, 
and sialyltransferase coding region minus the last 50 bases. Finally. pGASTK (described 
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above) was digested with M/u\ and the 7.0 kb fragment which contained the rest of the glaA 
promoter of 526 bases, the pyKB selectable marker, and the last 50 bases of 
sialyltransferase was isolated. Both 3.3 kb Mlu\ fragments of pSL:GADVST Part 2 and 
pSLGADVSTK Part 2 were ligated into the 7.0 kb Mlul fragment isolated from pGASTK. 
the final ligation yielded pGADVST and pGADVSTK. Both expression vectors were then 
grown for large scale isolation of the plasmid in order to obtain mass quantities of purified 
plasmid for fungal transformations. 

Example 3 
Fungal Transformations 

CsCI gradient purified plasmid DNA was used to transform Aspergillus n/gervar. awamori 
dgr246 P2. dgr246 P2. described by Ward, M. et al. (1993) Appl. Microbiol. Biotech. 
39:738-743, is a strain derived from UVK143f that has been deleted of its major aspartic 
proteinase (aspergiliopepsin) gene. Briefly, this strain had undergone several rounds of 
mutagenesis selection for improved expression of a heterologous gene product. The 
transformation protocol was a modification of the Campbell method (Campbell et al. (1989). 
Curr. Genet. 16:53-56). All solutions and media were either autoclaved or filter sterilized 
through a 0.2 micron filter. Spores of dgr246 P2 were harvested off of complex media agar 
(CMA) plates. CMA contained 20 g/L dextrose, 20 g/L DifcoBrand malt extract, 1 g/L Bacto 
Peptone. 20 g/L Bacto agar, 20 ml/L of 100 mg/ml arginine, 20 ml/L of 100 mg/ml uridine. 1 
ml/L of 10 mg/ml PABA, 10 ml/L of met/bio solution, and 1 ml/L of 50 mg/ml streptomycin. 
Met/bio solution was composed of 50 g of L-methionine. 200 mg of D-biotin and distilled 
water up to 1 liter. An agar plug of approximately 1.5 cm square bearing sporulating 
mycelium was used to inoculate 100 mis of yeast extract broth with glucose (YEG) (5 g/L 
yeast extract. 20 g/L glucose, 20 ml/L 100 mg/ml arginine. 20 ml/L 100 ml/ml uridine, 1 ml/L 
50 mg/ml streptomycin). The flask was incubated at 37^'C on a shaker at 250-275 rpm, 
ovemight. The mycelia were harvested through sterile Miracloth (Calbiochem, San Diego, 
CA, USA) in a funnel and washed with 200 mis of solution A (0.8 MgS04 in 10 mM sodium 
phosphate, pH 5.8). The washed mycelia were placed in a sterile solution of 100 mg of 
Novozym234 (Novo Nordisk Industri A/S, Baagsvad, Denmark) in 20 mis of solution A. 
This was incubated at 28°C at 200 rpm for 1 hour in a sterile 250 ml bottle (Corning Inc., 
Coming. NY, USA). After incubation, the protoplasting solution was filtered through sterile 
Miracloth filters into a sterile 50 ml conical tube (Sarstedt, USA). The resulting liquid 
containing protoplasts was divided equally amongst four 50 ml conical tubes. Forty mis of 
solution B (1.2 M sorbitol, 50 mM CaCb. 10 mM Tris, pH 7.5) were added to each tube and 
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centrifuged in a table top clinical centrifuge (Damon lEC HN Sll centrifuge) at 3/4 speed for 
10 minutes. The supernatant from each tube was discarded and 20 mis of fresh solution B 
was added to one tube, vortexed, then poured into the next tube until all the pellets were 
resuspended. The tube was then centrifuged at 3/4 speed for 10 minutes. The 
supernatant was discarded. 20 mis of fresh solution B was added, the tube was centrifuged 
for 10 minutes at 3/4 speed. The wash occurred one last time before resuspending the 
wash protoplasts into solution B, at 100 ul/0. 5-1.0 X 10^ protoplasts. To each 100 ul of 
protoplasts in a sterile 15 ml conical (Sarstedt. USA), 10 ul of the transforming plasmid 
DNA was added. To this, 12.5 ul of solution C (50% PEG 4000, 50 mM CaCb, 10 mM Tris, 
pH 7.5) was added to the 15 ml tube and placed on ice for 20 minutes. One ml of solution 
C was added to the tube, removed from the ice to room temperature and mixed gently. 
Two mis of solution B were added immediately to dilute solution C. The transforming mix 
was added equally to 3 tubes of melted MMS overlay (6 g/L NaNOa, 0.52 g/L KCI, 1.52 g/L 
KH2PO4, 218.5 g/L D-sorbitol, 1.0 ml/L trace elements-LW, 10 g/L SeaPlaque agarose 
(FMC Bioproducts, Rockland, Maine, USA) 20 ml/L 50% glucose, 2.5 ml/L 20% 
MgS04*7H20, 10 ml/L met/bio solution, 2.0 ml/L 10 mg/ml PABA pH to 6.5 with NaOH) that 
were stored in a 37*'C water bath. Trace elements LW consisted of 1 g/L FeS04*7H20, 8.8 
g/L ZnS04*7 H2O, 0.4 g/L CuS04*5 H2O, 0.15 g/L MnS04*4 H2O,0.1 g Na2B4O7*10 H2O, 50 
mg/L (NH4)6 M07O24M H2O, 250 mis H2O, 200 ul/L concentrated HCI. The melted overlays 
with the transformation mix were immediately plated onto 3 MMS plates (same as MMS 
overlay recipe with the exception of 20 g/L of Bacto agar instead of 10 g/L of SeaPlaque) 
that had been supplemented with 200 ul/plate of 100 mg/ml of arginine added directly on 
top of the agar plate. After the agar solidified, the plates were incubated at 37°C until 
transformants grew. 

The sporulating transformants were picked off with a sterile toothpick onto a plate of 
minimal media + glucose (MM). MM consisted of 6 g/L NaN03, 0,52 g/L KCI, 1.52 g/L 
KH2P04. 1 ml/L Trace elements LW, 20 g/L Bacto agar, pH to 6.5 with NaOH, 25 ml/L 40 
% glucose, 2.5 ml/L 20% MgSo4*7H20, 1 ml/L 50 mg/ml streptomycin plus 10 mg/ml 
carbeniciliin, 20 ml/L 100 mg/ml arginine. Once the transformants grew on MM, they were 
transferred to a CMA plate, one transformant per plate of CMA. 
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Example 4 

Screening of Sialyltransferase Transformants 

A 1.5 cm square agar plug of each transformant was grown in 50 mis in a 250 ml shake 
flask of inoculum built-up medium, CSL + 5% xylose (100 g/L com steep liquor (50% 
solids), National), 1 g/L NaHzPO/HzO, 0.5 g/L MgS04. 50 g/L xylose. 3 g/L mazu (Mazur 
Chemicals, USA), pH to 5.8 with NaOH, 1 ml/L 50 mg/ml streptomycin plus 10 mg/ml 
carbenicillin. Transformants and the parent control dgr246 P2 were incubated at 25**C. 160 
rpm, for 2 days. Five mis of the 2 day old medium were inoculated into 50 mis of Clofine 
special (70 g/L sodium citrate. 15 g/L (NH4)2 SO4, 1 g/L NaHj P04*H20, 1 g/L MgS04. 10 
ml 10% Tween 80. 2 ml/L mazu, 60 g/L spray dried tofu TF30 soy milk powder (Armour 
Good Ingredients, Springfield, Kentucky. USA), 12% final concentration of maltose, 20 
ml/L, 100 mg/ml arginine, 20 ml/L 100 mg/ml uridine, 10 ml/L met/bio solution. 1 mg/L 
streptomycin plus 10 mg/ml carbenicillin) in a 250 ml shake flask. The Clofine Special 
flasks were incubated at 25**C. 160 rpm for up to 7 days. Assays were performed on the 
supematants of the growing cultures to determine the best producers of the enzyme of 
interest. The sialyltransferase assays were performed as described below. 

Sialyltransferase Assay 

The sialyltransferase assay was used to measure the enzymatic activity of alpha 2,3 

sialyltransferase that catalyzed the following reaction: 

CMP-sialic acid > CMP + 

+ Lacto-N-Tetraose sialyl lacto-N-tetraose 

The sialylated product was separated from unsialylated material on a column of ion 

exchange resin. Use of [''^cj-labeled substrate allowed measurement of the sialylated 
product by liquid scintillation counting. The resin used in the columns (Poly-prep 
chromatography columns 0.8 x 4 cm, Bio-Rad, Hercules. CA, USA) was AG1-X8, 200-400 
mesh, chloride form (Bio-Rad). The resin was prepared by measuring out 100 g of the 
resin and then adding 400 mis of 1 M NaOH. The liquid was decanted and washed with 3 
liters of distilled water. After the water was decanted, 400 mis of 1 M phosphoric acid was 
added, stirred and the resin was allowed to settle. The phosphoric acid was then decanted 
and the resin was washed with 3 liters of distilled water. After the water was decanted, 400 
mis of 0.1 M sodium phosphate buffer. pH 6.8, was added and the resin and buffer slurry 
were further adjusted to pH 6.8. This was then washed a final time with 3 liters of distilled 
water and a final 50% slurry was made with MilliQ water. The slurry of prepared resin was 
swiried and 1 ml of the slurry was pipetted into a Bio-Rad column and allowed to settle. 
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The assay mix was made up of 0.5 mM cytidine 5' monophosphate sialic acid (CMP-SA) 
(Boehringer Mannheim, Indianapolis, IN, USA), 12.3 mM sodium cacodylate buffer, pH 6.0 
(Sigma. St, Louis, MO, USA). 1.25% Triton CF-54 (Sigma), and 1.8 mg/ml bovine serum 
albumin (Sigma). The acceptor molecule, lacto-N-tetraose (LNT) (Sigma) was made to a 7 
mM final concentration. The column buffer used to wash the resin was 5 mM sodium , 
phosphate buffer, pH 6.8. The concentration of the assay mix was measured after 
dissolving the dry NEN-619 cytidine 5' monophosphate sialic acid (CMP-[14C]-SA ) (NEN 
Research Products, Dupont. USA) with 0.65 ml of the non-radioactive 5.76 mM CMP-SA. A 
1:100 dilution of this was measured at OD270 against a dH20 blank. The CMP-SA 
concentration was calculated as follows: 



tCMP-SA concentration (Molar) = (OD270)(1 mol/8600)(dilution factor)] 

The assay mix was diluted so that when 25 ul of reaction mix were added to each sample, 
each sample received the equivalent of 3.4 ul of the above mixture. To the assay mixture, 
960 ul 1 M NaCacodylate, 480 ul 20% Triton CF-54 and 290 ul 50 mg/ml BSA (filter 
sterilized) was added. Three hundred ul aliquots of this final mixture were stored at -20X. 
The reaction mix was prepared by adding 250 ul of assay mix, 70 ul of 7 mM LNT, 12 ul of 
1 M MnCI and 168 ul dH20 to an Eppendorf tube. 

For each sample to be tested, 25 ul of reaction mix were added for each 14 ul of sample 
(supernatant from spent broth, standards and a buffer blank) to an Eppendorf tube. The 
tubes were spun down momentarily'^in a microcentrifuge and incubated in a 37**C water 
bath for 10 minutes. The standard alpha 2,3 sialyltransferase was provided by Cytel 
Corporation (San Diego. CA, USA) at 1 unit/ml. From this, dilutions using the sodium 
phosphate buffer were made for a range of 20-140 milli units, to generate a standard curve. 
Dilutions of the spent supematants were also made in the sodium phosphate buffer. After 
incubation, 1 ml of column buffer was added to each tube and each sample was passed 
over an equilibrated column of resin. The unbound product was collected into a scintillation 
tube, washed with another 1 ml of column buffer, collected again and pooled. To each 
tube, 5 mis of aqueous scintillation fluid (ICN Ecolume, Costa Mesa. CA. USA) were added, 
shaken vigorously and counted in a liquid scintillation counter (TM Analytical) for 1 minute. 
The formulae required for the calculations of sialyltransferase activity units were as follows: 
a) Total nmol CMP-SA in reaction = CMP-SA X 3.4 ul 

where CMP-SA concentration = molar concentration as determined above; 
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b) cpm/nmol = total reaction cpm/nmol CMP-SA 

where the total reaction cpm = total counts added to the reaction, corrected 
for blank and nmol CMP-SA = total nmol CMP-SA in the reaction as 
calculated in a); and finally 

c) Activity (units/ml) = umol/min/ml = 

[(sample cpm)/(cpm/nmol)]{1/10min)(dilution factor)(1/0.014ml)(1.6)(1 
umol/1000 nmol) 

sample cpm = average cpm of sample corrected for average blank cpm 
10 minutes = reiaction incubation time 
0.005 = volume of enzyme dilution tested 

1.6 = unit definition was specified at saturating substrate concentration. 
Since the standard assay contained a subsaturating level of LNT. activity 
was multiplied by a factor of 1.6 to reflect maximal velocity by analysis of 
kinetics. 



Example 5 
Screening of ST Transformants 

Following the transformation methods detailed in Example 3, twenty transformants with the 
pGAST expression vector, 22 with the pGASTK expression vector. 9 with the pGADVST 
expression vector^ and 12 with the pGADVSTK expression vector were screened in shake 
flasks. Using the sialyltransferase activity assay of Example 4, the following transformant 
exhibited the best sialyltransferase levels secreted by transformants with each expression 
vector for pGAST transfomiant #20, 270 units/liter, for pGASTK transformant #4, 210 
units/liter, for pGADVST#6, 120 units/liter and for pGADVSTK #1 , 543 units/liter. Media 
optimization work as described in Example 6 was done with pGAD\/STK#1 (DVK1). 



Example 6 

Media and Process Development for the Production 
of Sialyltransferase in Aspergillus 

a. Inoculum Medium 

The medium used for the generation of cell mass was composed of: 100 g/L corn steep 
liquor; 0.5g/L magnesium sulfate; 1 g/L monobasic sodium phosphate monohydrate; 3 ml/L 
Mazu; 50 g/L xylose. The medium pH was adjusted to 5.8. The inoculum medium was 
inoculated with the desired culture and incubated at 25°C for 48 hr at 250 rpm. After the 
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completion of the incubation time a given volume would be transferred to the production 
stage. 

b. Production Medium 

The initial medium used for the media development work was a liquid production medium, 
"Clofine Special." Clofine Special contained 60 g/L spray dried soy milk; 15 g/L ammonium 
sulfate; 70 g/L sodium citrate; 1 g/L magnesium sulfate; 1 ml/L Tween 80; 1 ml/L Mazu; 0.5 
g/L potassium chloride; 1 g/L monobasic sodium phosphate monohydrate; 2 g/L arginine; 2 
g/L uridine; and 120 g/L maltose. The pH of the medium was adjusted to 6.2. The 
production media was inoculated with 5 mis of the desired culture, previously grown in the 
inoculum media. The production media was incubated the same as the inoculum media, 
except for a longer period of time. 

X-Stat, a statistical design software, was used to conduct a number of experiments directed 
at the optimization of the production of glycosyltransferase, and particulariy ST. After 
several experiments using X-Stat a new media evolved. The new medium contained: 
KH2PO4. 1 g/L; MgS04, 0.8 g/L; KCI, 1 g/L; arginine, 4 g/L; Tween 80. 1 ml/L; Mazu, 13 
ml/L; spray dried soy milk, 60 g/L; maltose, 100 g/L; glucose, 10 g/L; ammonium sulfate, 
13 g/L; sodium citrate, 60 g/L. 

It was determined that the ST was aggregating to the spray dried soy milk in the media. 
Experiments were designed to look for an alternative complex nitrogen source for the, 
medium. The new medium recipe described above was used to examine different nitrogen 
sources by substituting spray dried soy milk with wheat flour, Primetone CLT, Primetone 
SG, com gluten and Sheftone. DVKI was grown in media containing the various nitrogen 
sources and it was determined that Sheftone N provided a good source of complex 
nitrogen, and additionally added stability to the ST produced in the medium. Thus, this 
Sheftone N-containing medium had the advantage of improving the expression of the ST, 
as well as provided advantages during the recovery process of the protein (less 
degradation during recovery). Additional experiments followed this finding. Experiments 
were set up to study different concentrations of Sheftone N in the media. Results indicated 
that various concentrations of Sheftone N resulted in good expression, while 
concentrations from about 40 g/L up to about 100 g/L (preferably ^80 g/L of Sheftone N) 
resulted in increased production of ST. The data are shown in Figure 4. 
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Additionally, during media optimization it was determined that the pH of the medium had a 
very important effect on the production and stability of the ST produced. Experiments 
studying pH were conducted. Results indicated that by increasing the pH of the medium 
over about pH 6 up to 8, increased levels of ST were produced. See Figure 5. 

c. Buffer Concentration Studies 

Based on the results shown in Figure 5, which indicated the pH of the medium had an 
effect on the production of ST. and knowing that the desired pH range for the medium was 
neutral, an experiment was conducted to study the buffer capacity of the medium. Sodium 
citrate which was present in the medium at an initial concentration of 60 g/L (see above) 
was increased up to at least 120 g/L. The results, as shown in Figure 6. indicated that 
stronger buffering capacity in the medium gave higher production of ST. 

d. Process Development/14 L Fermentation Experiments 

In order to scale up the results described above. Sheftone N-containing media (without the 
spray dried soy milk) at a concentration of 100 g/L Sheftone N and 60 g/L sodium citrate 
was used. The pH was adjusted to 6.5 and controlled at 5.8 with ammonia hydroxide. The 
fermenter produced 988 units/ml of ST. The yield from the fermentation came very close to 
the theoretical results based on the work presented here of 1057 units/ml. This represents 
a 93.5% duplication efficiency. 

Example 7 
Construction of pGAKHi+ 
The Expression Vector Used to Express Fucosyltransferase 

An expression plasmid was constructed using methods known in the art. which allows DNA 
sequences encoding desired polypeptides to be fused to the C-terminus of the catalytic 
core and part of the linker region of Aspergillus glucoamylase. 

The plasmid pGAKHi+ was constructed based on the bacterial plasmid pSL1 180 
(Pharmacia LKB). This plasmid is similar to the pUC series of phagemids such as pUC118 
but has an extended multiple cloning site (MCS) containing all possible six base pair 
palindromes including two recognition sites for C/al, one of which is methylated and the 
other not methylated by £. coli dam methylase. An expression cassette was inserted into 
the MCS of pSL1 1 80 so that the first 28 bp of the MCS from H/ndill to Spel was present at 
one end. Beginning at the Spel recognition site, the expression cassette contains the 
following components: the promoter and 5' flanking region of the glaA gene of A. n/ger van 
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awamori strain UVK143f (US Patent 5,364.770) starting at a Spe\ site approximately 1 kb 
upstream from the translation start codon; the coding region of the giaA gene from strain 
UVK 143f from the start codon to an Nhe\ site, the six bp which encode amino acids 497 
and 498 in the linker region of mature glucoamylase; a synthetic piece of double stranded 
DNA with Nhe\ and BstEW ends; approximately 250 bp of the terminator region of the glaA 
gene from A. n/ger strain #7 (US Patent 5,364,770) beginning at a BstEW site 20 bp 5' of 
the translation stop codon and ending at an EcoRV site; an approximately 2.4 kb fragment 
containing the A. nigerpyrG gene (Wilson et al., 1988) obtained as a Smal to Seal 
fragment from pBH2 (Ward et al., 1989) ( the Smal site exists within the pUC18 MCS of 
pBH2 and the Seal site exists 65 bp from the end of the Say77Hi-H/ndlll fragment of A. niger 
DNA present in pBH2); approximately 1.3 kb of 3' flanking DNA from the glaA from A. niger, 
beginning at the EcoRV site 250 bp 3' of the translation stop codon and ending at a C/al 
site. After the C/al site there exists 75 bp of the pSL1 180 MCS from C/al (the site not 
methylated by E, coli dam methylase) to H/ndlll, followed by bp of pBR322 from H/ndlll to 
C/al. followed by 198 bp of the pSL1180 MCS from C/al (the site methylated by E. coli dam 
methylase) to EcoRI. 

As shown in Figure 7 this expression vector (pGAKHi+) contains unique Nhe\ and BstEW 
sites which can be utilized to insert a DNA fragment encoding a desired polypeptide such 
that a fusion protein (comprising the glucoamylase catalytic core, part of the glucoamylase 
linker region and the desired polypeptide) would be expected to be produced after 
transformation into Aspergillus strains. The pyrG gene was included as a selectable marker 
for transformation into Aspergillus strains which are pyrG null mutants. 

Example 8 

Construction of pGAKHi-FT 

Source of the FT cDNA 

The human a1,3 fucosyltransferase V (FT) cDNA was obtained from Dr. John Lowe at the 
University of Michigan. The clone pcDNAI-FT contains the full-length FT sequence in 
pcDNAI vector (Weston, B.W.; Nair, R.P.; Lowe, J.B. (1991) J. Biol. Chem. 267:4152-60). 
The full-length clone encompasses an amino-terminal cytoplasmic tail, a transmembrane 
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signal anchor, a stem region, and a carboxy-terminal catalytic domain as typical of 
glycosyltransferases (Paulson. J.C.; Coliey, K.J. (1989) J. Biol. Chem. 264:17615-17618.) 

Construction of pGAKHi-FT Plasmid 

The construct was designed so that the A. n/ger glucoamylase gene would be fused in 
frame to the catalytic domain of the FT gene. The FT gene was amplified using native Pfu 
polymerase (Stratagene) and the polymerase chain reaction (PGR) using Buffer #1 and 
conditions supplied by the manufacturer. The primers generate an Nhel restriction site at 
the 5' terminus and a BstEII site at the 3' temiinus. The 5' primer was designed to 
hybridize to the stem portion of the FT gene, which fuses in-frame to the A, niger 
glucoamylase sequence of pGAKHi+ (Example 8), with a kBx2 proteolytic cleavage site 
between the glucoamylase and FT genes. The 3' primer was designed to amplify the 3' 
end of the FT coding region including the stop codon. The 5' and 3' primers were 5'-CTA- 
GCT-AGC-AAG-CGC-GGA-TCC-CCT-AGG-CCA-3' (Seq ID No. 11) and 5-CCC-GGT- 
GAC-CTC-AGG-TGA-ACC-AAG-CC-3* (Seq ID No. 12), respectively. The PGR procedure 
amplified the stem region and catalytic domain of the FT gene, yielding a 1.2 kb fragment. 

The amplified FT gene was sequentially digested with the enzymes Nhe I and BstEII (New 
England Biolabs), using conditions provided by the manufacturer, and fractionated on an 
agarose gel. The 1.2 kb fragment was isolated using a Prep-A-Gene kit (BioRad) and 
ligated under standard conditions into the unique Nhel/BstEII sites of plasmid vector 
pGAKHi+. This fused the stem region of the FT gene in-frame with the glucoamylase 
sequence, generating pGAKHi-FT (Figure 8). Plasmids were transformed into E. coli strain 
DH5a, and ampicillin-resistant colonies were screened for the presence of the FT gene. 
The glucoamylase-FT fusion junction was sequenced using a primer that hybridizes to the 
glucoamylase sequence (5*-GCG-AGT-ATG-GTG-GCT-AGT-GGG-GGG-3' (Seq ID No. 
13)). The plasmid of the correct construct was purified using Qiaex matrix (Qiagen). 

Example 9 
Screening of FT Transformants 
Transfonnants were grown on agar plates of MM containing 5% xylose for 8 days at 37''C 
to generate spores. The spores were harvested in 0.1% Tween/20% glycerol and stored at 
-70**C until used as stocks. 
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Spores (0.3 ml) from stocks were inoculated into 250 ml baffled flasks containing 50 mis of 
CSL + 5% xylose; cultures were grown at 37*'C, 130 rpm for 2 days. This culture was used 
as the inoculum for the production medium. 

Two production media, Clofine Special (Example 7) and Sheftone N (Example 7), were' 
used for screening transformants. A 10% inoculum of CSL culture was inoculated into 50 
mis of production medium in 250 ml baffled flasks. Cultures were grown at 32*'C, 130 rpm 
for 4 days, at which time supematants were assayed. 

The protocol for the FT assay is provided below and is available from Cytel, San Diego, CA. 
The assay measures the FT enzymatic activity and catalyzes the following reaction: GDP- 
fucose + N-acetyllactosamine — > GDP + fucosyl N-acetyllactosamine. The fucosyfated 
product was separated from GDP-fucose on an ion exchange column resin (AG1 X 8. 
chloride form). Use of ^"^C-labeled substrate (GDP-f^'^C] fucose) allowed measurement of 
the fucosylated product by liquid scintillation counting. 



Fucosvltransferase Assay Protocol 
Cocktail Mix 

70 ul [^'*C]GDP-fucose (ca. 1.5 x 10® cpm) Evaporate with N2 
5.5 ul cold GDP-fucose (6 mg/ml) 
35 ul 1 M Na Cacodylate. pH 6.0 
13ul 1 M MnCl2 

130 ul 100 mM N-acetyllactosamine 
400 ul dH^O 
584 ul total 



Final Concentration 

0.3 nmoles 
4.0 nmoles 
50 mM 
20 mM 
20 mM 



Assay 

45 ul cocktail mix 
5 ul diluted enzyme 
50 ul total per tube 



Preheat tube with 45 ul reaction mix, 37'*C for ca. 5 min 
Add 5 ul diluted enzyme; time reactions 
Incubate 10 min at 37X 
Terminate with 1 ml dH20 



Run through AG1-X8 (chloride form) column (1.5 ml of 50:50 slurry) 

Wash 1 time with 1 ml dH20 

Count 



- Run negative control 
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- Count 45 ul cocktail mix control for determining specific activity 

Example 10 

Media Optimization for Increasing FT Expression 
Various parameters were tested for increasing FT expression in transformant KK17. 
Temperature, pH, incubation time, agitation speed, inoculum size, and soy protein source 
were among the variables tested. Several of these parameters affected the level of FT 
expression. For example, the optimum agitation speed for FT activity was about 180-220 
rpm (Figure 9). In addition, pH affected expression, having an optimum value at pH 7.0 
(Figure 10). Doubling the inoculum size from 10 to 20% increased activity at pH 7.0, 220 
rpm from 580 U per liter to 690 U per liter 



Example 1 1 

Recombinant Baculovirus Expression System 
Construction of Recombinant Baculovirus 

The recombinant baculovirus expressing ST was constructed as described in Ichikawa et 
al. (1992. J. Am. Chem. Soc. 114:9283-9298). The gene was fused to the human insulin 
leader sequence, which allows the glycosyitransferase to be secreted. The baculovirus 
construction made for FT expression was similar, except the cDNA was cloned into the 
vector pVL1393, as illustrated in Fig. 11. 

Stock cultures of Spodoptera frugoperda insect cells (Sf9ll; Gibco/BRL) were grown at 
27.5''C in 2-liter Fembach flasks containing 1 liter of Sf900 II serum-free media 
(Gibco/BRL). Circularized pVL1393-FT or pBlueBac-ST (2 mg) was co-transfected with 
linearized BaculoGold viral DNA (0.25 mg; PharMingen) into Sf9ll cell monolayers using the 
transfection protocol of Invitrogen. The media containing recombinant virus particles was 
harvested after 5 days incubation at 27**C. To isolate single viral clones, 2.5 x 10^ Sf9ll 
cells were infected with serial 5-fold dilutions (12 each) of virus in 96-well limiting dilution 
plates (O'Reilly, D.R., Miller, LK., Luckow, V.A., in Baculovirus Expression Vectors, A 
Laboratory Manual, W.H. Freeman and Co., 1992). Vinjs from wells exhibiting cell lysis at 
the end-point dilution were amplified by infecting 5x10^ cells with 0,2 pfu per cell in 24-well 
dishes. The amplified clones were then tested for activity: 5x10^ cells were infected with 
3-5 pfu per cell and dishes were incubated for 48 hrs. All of the clones were positive for 
activity, and one clone was used for the generation of the viral stock (passage 2), which 
was made in monolayer as follows: 3.0 x 10^ cells were absorbed to a T225 cm^ tissue 
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culture flask and infected with 0.2 pfu virus per cell for 1 hr at room temperature. Fresh 
Sf900ll media (30 ml) was then added, and flasks were incubated for 48 hrs at 27.5'*C. 
Virus was harvested by centrifugation (3700 rpm, 20 min). Supernatants were stored at 4C 
and used to make virus stocks for large scale protein production. Virus titer was 
determined by limiting dilution assay and was typically ca. 3 x 10^ pfu per ml. 

Protein Production 

Virus stocks (passage 3) were produced in shake flasks as follows: Sf9ll cells were grown 
to 8 X 10® cells per ml in 2 L Fernbach flasks containing 0.5 L of Sf900ll media. Virus from 
passage 2 was added at 0.2 pfu per cell; another 0,5 L of fresh Sf900ll media was added, 
and flasks were incubated in shaking incubators (27.5X. 130 rpm). Virus was harvested at 
48 hrs by centrifugation (3700 rpm, 20 min), and supernatants were checked for 
contamination and stored at 4*C. Virus titers were determined by limiting dilution assay 
and were typically 2x10® pfu per ml. 

For protein production, Sf9 II cells (1 L) were grown at 27.5X in 2 L cotton-stoppered 
Fernbach flasks to ca. 8x10® ceils per ml. Cells were gently centrifuged (1000 rpm, 5 min) 
in sterile 1 liter bottles; supernatants were carefully dumped and virus was added to ca. 5-7 
pfu per cell (ca. 25-50 ml). Fresh Sf900ll medium was immediately added to restore the 
original volume, and cells were shaken at 27.5°C. Supernatants were harvested by 
centrifugation (3700 rpm; 15') at 48 hrs post-infection for ST or 72 hrs for FT. 

Example 12 

Comparison of A. n/ger Expression System to Baculovirus Expression System 
Baculoviruses are insect-specific viruses that receive much interest for their use as 
recombinant protein expression systems. It is suggested to have a high rate and yield of 
expression, so it was desirable to compare it to the A, n/ger system described in this patent. 

ST and FT activities in the two expression systems are shown in Table 2, which are the 
maximum activities over time. 

Table 2 

Maximum FT and ST Activities in a Recombinant Baculovirus Expression System 
as Compared to the A. niger Expression System 

System ST (U/Liter) FT (U/Liter) 
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Baculovinjs 
A. niger 



140 
1700 



50 
600 



The recombinant ST baculovirus produced 140 U/liter, while the A, n/ger transformant 
produced 1700 U/liter. Likewise, the FT recombinant baculovirus expressed 50 U/liter, 
while FT A, n/ger transformant produced 600 U/liter. Both genes expressed 12-fold higher 
activities in A. niger than the corresponding baculovirus system. 

The foregoing are presented by way of example only and should not be construed as a 
limitation to the scope of permissible claims. 

Having described the preferred embodiments of the present invention, it would appear to 
those ordinarily skilled in the art that various modifications may be made to the disclosed 
embodiments, and that such modifications are intended to be within the scope of the 
present invention. 

All references are expressly incorporated herein by reference. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Kodama, Katherine 

(B) STREET: 180 Kimball Way 

(C) CITY: So. San Francisco 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 94080 

(G) TELEPHONE: (415) 742-7536 

(H) TELEFAX: (415) 742-7217 

(A) NAME: Ward, Michael 

(B) STREET: 180 Kimball Way 

(C) CITY: So. San Francisco 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 94080 

(G) TELEPHONE: (415) 742-7536 

(H) TELEFAX: (415) 742-7217 

(ii) TITLE OF INVENTION: Expression of Glycosyltransf erase in 
Aspergillus 

(iii) NUMBER OF SEQUENCES: 13 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
ATGGGAAGAC TCGAATTCAC TGATT 25 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GCCGGACACT GGTCTAGAGA GGCTCAC 27 
(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TCGACCGCGA CGGTGACCGA CACCTGGCGG AAGCGCCTAC ACTTACTCCA ATGGGAAGAC 60 
TCG 63 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AATTCGAGTC TTCCCATTGG AGTAAGTGTA GGCGCTTCCG CCAGGTGTCG GTCACCGTCG 
CGG 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TCGACCGCGA CGGTGACCGA CACCTGGCGG CTACACTTAC TCCAATGGGA AGACTCG 57 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



60 
63 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AATTCGAGTC TTCCCATTGG AGTT^GTGTA GCCGCCAGGT GTCGGTCACC GTCGCGG 57 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CTAGCAAAGCTACA CTTACTCCAA TGGGAAGACT CG 36 
(2) INFORMATION FOR SEQ ID NO: 8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AATTCGAGTC TTACCATTGG AGTAAGTGTA GCTTG 35 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CTAGCAAGCG CCTACACTTA CTCCAATGGG AAGACTCG 3 8 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AATTCGAGTC TTACCATTGG AGTAAGTGTA GGCGCTTG ' 38 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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CTAGCTAGCA AGCGCGGATC CCCTAGGCCA 3 0 

(2) INFORKATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCCGGTGACC TCAGGTGAAC CAAGCC 2 6 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCGAGTATCG TGGCTACTGG CGGC 2 4 
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What Is Claimed Is : 

1 . A fusion DNA sequence encoding a fusion polypeptide comprising from the 5* end 
of said fusion DNA sequence: 

a) DNA encoding a signal peptide functional in Aspergillus: 

b) DNA encoding a secreted polypeptide or portion thereof normally secreted 
from Aspergillus] 

c) optionally DNA encoding a cleavable linker polypeptide; and 

d) DNA encoding a desired glycosyltransferase. 

2. A fusion DNA sequence of Claim 1 wherein the desired glycosyltransferase is 
selected from the group consisting of sialyltransferase, galactosyltransferase and 
fucosyltransferase. 

3. A fusion DNA sequence of Claim 1 wherein the signal peptide is selected from the 
group consisting of signal peptides from glucoamylase, a-amylase, and aspartyl 
protease from Aspergillus species and signal peptides from Trichoderma 
cellobiohydrolase I and II, and endoglucanase I and 11. 

4. A fusion DNA of Claim 3 wherein the signal peptide is the signal peptide from A. 
n/ger var. awamon glucoamylase. 

5. A fusion DNA of Claim 1 wherein the secreted polypeptide or portion thereof is 
glucoamylase from Aspergillus. 

6. A fusion DNA of Claim 5 wherein the secreted polypeptide is the full length mature 
glucoamylase from A. niger war. awamorf. 

7. A fusion DNA of Claim 5 wherein the secreted polypeptide is a portion comprising 
50% of glucoamylase from A, niger war. awamori. 

8. A fusion DNA of Claim 1 wherein the signal peptide comprises the signal peptide of 
A. niger var. a wamon glucoamylase, the secreted polypeptide or portion thereof 
comprises glucoamylase from A, niger war. awamori, and the desired transferase is 
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selected from the group consisting of sialyltransferase, galactosyltransferase and 
fucosyltransferase. 

9. An expression vector for transforming an Aspergillus host cell comprising DNA 
sequences encoding regulatory sequences functionally recognized by the 
Aspergillus host, including promoter and transcription and translation initiation 
sequences operably linked to the 5* end of the fusion DNA sequence of Claim 1 and 
transcription stop sequences and polyadenylation sequences operably linked to the 
3' end of said fusion DNA sequence. 

10. An Aspergillus containing an expression vector of Claim 9. 

11. A fusion polypeptide comprising from the 5' end: 

a) an amino acid sequence encoding a signal peptide functional in Aspergillus] 

b) an amino acid sequence encoding a secreted polypeptide or portion thereof 
normally secreted from Aspergillus: 

c) optionally an amino acid sequence encoding a cleavable linker; and 

d) an amino acid sequence encoding a desired glycosyltransferase. 

12. A fusion polypeptide of Claim 1 1 wherein the desired glycosyltransferase is selected 
from the group consisting of sialyltransferase, galactosyltransferase and 
fucosyltransferase. 

13. A fusion polypeptide of Claim 1 1 wherein the signal peptide is selected from the 
group consisting of signal peptides from glucoamylase, a-amylase, and aspartyl 
protease from Aspergillus species and signal peptides from Trichoderma 
cellobiohydrolase t and II, and endoglucanase I and 11. 

14. A fusion polypeptide of Claim 13 wherein the signal peptide is the signal peptide 
from A, niger y/ar, a wamon glucoamylase. 



15. 



A fusion polypeptide of Claim 1 1 wherein the secreted polypeptide or portion thereof 
is glucoamylase from Aspergillus, 
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16. A fusion polypeptide of Claim 15 wherein the secreted polypeptide is the full length 
mature glucoamylase from A. nlger^Bx. awamori. 



M, A fusion polypeptide of Claim 15 wherein the secreted polypeptide is a portion 
comprising 50% of glucoamylase from A. n/gfervar awamori. 



18. A fusion polypeptide of Claim 1 1 wherein the signal peptide comprises the signal 
peptide of A, n/ger var. awamori glucoamylase, the secreted polypeptide or portion 
thereof comprises glucoamylase from A. n/ger var, awamori, and the desired 
glycosyltransferase is selected from the group consisting of sialyltransferase, 
galactosyltransferase and fucosyltransferase. 
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GAM-ST link Sal1-EcoR1 no kex 
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